How Bad Do You Spell?: The Lexical Quality of Social Media

نویسندگان

  • Ricardo A. Baeza-Yates
  • Luz Rello
چکیده

In this study we present an analysis of the lexical quality of social media in the Web, focusing on the Web 2.0, social networks, blogs and micro-blogs, multimedia and opinions. We find that blogs and social networks are the main players and also the main contributors to the bad lexical quality of the Web. We also compare our results with the rest of the Web finding that in general social media has worse lexical quality than the average Web and that their quality is one order of magnitude worse than high quality sites.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Social Media Is NOT that Bad! The Lexical Quality of Social Media

There is a strong correlation between spelling errors and web text content quality. Using our lexical quality measure, based in a small corpus of spelling errors, we present an estimation of the lexical quality of the main Social Media sites. This paper presents an updated and complete analysis of the lexical quality of Social Media written in English and Spanish, including how lexical quality ...

متن کامل

Investigating the Social Practice of Persian Translations of ‘The Girl You Left Behind’ through Translators’ Lexical and Grammatical Strategies

The present study aimed to shed light upon the differences of social practice of Persian translations of The Girl You Left Behind written by Jojo Moyes (2012) with original text in English based on Fairclough's (1995) model. In this regard, through a careful analysis of the source and target texts, English social prac- tice instances were selected along with their Persian equivalents as the cor...

متن کامل

درآمدی بر مبنای مکان یابی و طراحی بیمارستان ها

Background: The hospital is an important element in the new public health. The health in the populations requires access to the medical and hospital services as well as preventive care and a healthy environment. This study attempts to review the important factors to be considered in the hospital sites selected and design in the urban, regional and country levels. Finally, suggestions have exhib...

متن کامل

What to do about bad language on the internet

The rise of social media has brought computational linguistics in ever-closer contact with bad language: text that defies our expectations about vocabulary, spelling, and syntax. This paper surveys the landscape of bad language, and offers a critical review of the NLP community’s response, which has largely followed two paths: normalization and domain adaptation. Each approach is evaluated in t...

متن کامل

Factors Affecting Social Commerce and Exploring the Mediating Role of Perceived Risk (Case Study: Social Media Users in Isfahan)

Owing to the ever-increasing prevalence of social media use, social commerce has become an important part of e-commerce. This study endeavors to explore the impact of social media quality and social support on the social commerce (SC) intention directly and through the variable of perceived risk. The sample included 214 social media users in Isfahan collected through simple random sampling meth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011